Grid Simulation Tools for Job Scheduling and Data File Replication
نویسندگان
چکیده
In recent years, grid computing has emerged as one of the main computational platforms to perform many extremely difficult and/or time-consuming tasks. The amount of data generated every second is in fact sometimes much more than what can be processed even by dedicated grids. This makes the design of optimum solutions to efficiently balance and deploy existing systems one of the main objectives/ bottleneck in grids. These computational platforms are, however, too complicated to estimate/gauge efficiently of any algorithm without actually testing it. Thus, because accessing real systems is almost impossible for many reasons, including cost and trust, simulation becomes one of the inevitable stages before actual deployment of any algorithm in this field. In this chapter, first several simulation tools are listed, and then the problem statement behind all these grids is mathematically modeled and presented. Simulation is one major step in modeling many real-world processes before their actual deployments. Proper simulations can provide an extensive study of a system and reveal its many unknown aspects before actual deployment, including, but not limited to, feasibility, behavioral, and performance analysis. Industrial processes, parallel and distributed systems, and environmental resources are among many that receive direct benefits from such simulations. Although simulations mean to 1
منابع مشابه
An Efficient Data Replication Strategy in Large-Scale Data Grid Environments Based on Availability and Popularity
The data grid technology, which uses the scale of the Internet to solve storage limitation for the huge amount of data, has become one of the hot research topics. Recently, data replication strategies have been widely employed in distributed environment to copy frequently accessed data in suitable sites. The primary purposes are shortening distance of file transmission and achieving files from ...
متن کاملData Replication-Based Scheduling in Cloud Computing Environment
Abstract— High-performance computing and vast storage are two key factors required for executing data-intensive applications. In comparison with traditional distributed systems like data grid, cloud computing provides these factors in a more affordable, scalable and elastic platform. Furthermore, accessing data files is critical for performing such applications. Sometimes accessing data becomes...
متن کاملImproving Data Grids Performance by Using Modified Dynamic Hierarchical Replication Strategy
Abstract: A Data Grid connects a collection of geographically distributed computational and storage resources that enables users to share data and other resources. Data replication, a technique much discussed by Data Grid researchers in recent years creates multiple copies of file and places them in various locations to shorten file access times. In this paper, a dynamic data replication strate...
متن کاملA New Job Scheduling in Data Grid Environment Based on Data and Computational Resource Availability
Data Grid is an infrastructure that controls huge amount of data files, and provides intensive computational resources across geographically distributed collaboration. The heterogeneity and geographic dispersion of grid resources and applications place some complex problems such as job scheduling. Most existing scheduling algorithms in Grids only focus on one kind of Grid jobs which can be data...
متن کاملJob Scheduling and Data Replication in Hierarchical Data Grid
Data Grid environment is a geographically distributed that deal with date-intensive application in scientific and enterprise computing. In data-intensive applications data transfer is a primary cause of job execution delay. Data access time depends on bandwidth, especially when hierarchy of bandwidth appears in network. Effective job scheduling can reduce data transfer time by considering hiera...
متن کاملReplication in Data Grid
Data Grid environment is a geographically distributed that deal with date-intensive application in scientific and enterprise computing. In data-intensive applications data transfer is a primary cause of job execution delay. Data access time depends on bandwidth, especially when hierarchy of bandwidth appears in network. Effective job scheduling can reduce data transfer time by considering hiera...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2012